Immersive Interactive Data Mining and Machine Learning Algorithms for Big Data Visualization

نویسنده

  • Mohammadreza Babaee
چکیده

The amount of collected Earth Observation (EO) images is increasing exponentially and their growth is currently in the order of several terabytes per day. Therefore, the ability to automatically store and retrieve these images based on their content is highly desired. Traditional approaches are not accurate and robust enough to handle this massive amount of data. However, the combination of artificial intelligence and human intelligence could deliver promising results. Therefore, this thesis addresses several challenges in the field of human-machine communication for data mining applications. This is mainly done by first introducing an Immersive Visual Data Mining (IVDM) system, including image collections and feature space visualizations, interactive dimensionality reduction, and active learning for image classification. A Cave Automatic Virtual Environment (CAVE) is employed to support the user-image interactions and also immersive data visualization, which allows the user to navigate through the images and explore them. The feature space is visualized by applying state-of-the-art dimensionality reduction techniques to reduce the dimensionality to 3D. Additionally, a novel algorithm based on Non-negative Matrix Factorization (NMF) is developed to arrange the images in 3D space by decreasing the occlusion among images and to make use of the display space more efficiently. Two interactive dimensionality reduction algorithms are introduced to enhance the discriminative property of the features by incorporating the user-image interactions. To annotate images, a novel active learning algorithm is proposed to choose the most informative images for labeling. Finally, experimental evaluations using publicly available data sets demonstrate the efficiency of the proposed algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards better analysis of machine learning models: A visual analytics perspective

Interactive model analysis, the process of understanding, diagnosing, and refining a machine learning model with the help of interactive visualization, is very important for users to efficiently solve real-world artificial intelligence and data mining problems. Dramatic advances in big data analytics has led to a wide variety of interactive model analysis tasks. In this paper, we present a comp...

متن کامل

Big Data Algorithms for Visualization and Supervised Learning

Explosive growth in data size, data complexity, and data rates, triggered by emergence of high-throughput technologies such as remote sensing, crowd-sourcing, social networks, or computational advertising, in recent years has led to an increasing availability of data sets of unprecedented scales, with billions of high-dimensional data examples stored on hundreds of terabytes of memory. In order...

متن کامل

Beyond Data Mining: Integrative Machine Learning for Health Informatics

Machine learning (ML) aims at learning from data to gain knowledge from experience and to make predictions. ML is the most growing field in computer science and health is amongst the greatest application challenges. Health systems worldwide are confronted with big data, hence algorithms that improve through experience can be of great help and automatic ML (aML) approaches show impressive result...

متن کامل

Interactive Data Mining and Visualization

Interactive analysis introduces dynamic changes in Visualization. On another hand, advanced visualization can provide different perspectives of the data to the user, hence, provide effective way of data mining. This paper discusses new ideas for interactive data mining tool based on R through HCI techniques. Also demonstrates the purposed features through data mining and visualization examples,...

متن کامل

An Interactive Machine Learning Framework

Machine learning (ML) is believed to be an effective and efficient tool to build reliable prediction model or extract useful structure from an avalanche of data. However, ML is also criticized by its difficulty in interpretation and complicated parameter tuning. In contrast, visualization is able to well organize and visually encode the entangled information in data and guild audiences to simpl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016